Community Finding: Partitioning Considered Harmful

نویسندگان

  • Fergal Reid
  • Aaron McDaid
  • Neil Hurley
چکیده

Considering a clique as a conservative definition of community structure, we examine how partitioning algorithms interact with cliques. We show that on a wide range of empirical networks, from different domains, significant numbers of cliques are split across different partitions by popular algorithms. We examine the largest connected component of the subgraph formed by retaining only edges in cliques, and apply partitioning strategies that explicitly minimise the number of cliques split. We conclude that, due to the connectedness of many networks, any partitioning community finding algorithm must fail to return at least some significant structure. Moreover, contrary to traditional intuition, strong ties and cliques frequently do cross community boundaries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Haplotype Block Partitioning and tagSNP Selection under the Perfect Phylogeny Model

Single Nucleotide Polymorphisms (SNPs) are the most usual form of polymorphism in human genome.Analyses of genetic variations have revealed that individual genomes share common SNP-haplotypes. Theparticular pattern of these common variations forms a block-like structure on human genome. In this work,we develop a new method based on the Perfect Phylogeny Model to identify haplo...

متن کامل

Short Random Walks for Community Discovery in Social Networks

The study of networks is an active area of research due to its capability of modeling many real world complex systems. One such interesting property to investigate in any typical network is the community structure which is the division of networks into groups. The study of community structure in networks is closely related to the ideas of graph partitioning in graph theory. Finding an exact sol...

متن کامل

An employee transporting problem

An employee transporting problem is described and a set partitioning model is developed. An investigation of the model leads to a knapsack problem as a surrogate problem. Finding a partition corresponding to the knapsack problem provides a solution to the problem. An exact algorithm is proposed to obtain a partition (subset-vehicle combination) corresponding to the knapsack solution. It require...

متن کامل

اندازه‌گیری تأثیر سلامت بر رشد اقتصادی

Introduction: Economic growth in the literature is considered as a function of labor, capital, education level, and labor productivity. However, it can be affected by mental, physical, and emotional health. Accordingly, this paper aimed to estimate the impact of health on economic growth by using the production function method. Method: the research population, in this cause-effect study, consi...

متن کامل

A Greedy Community-Mining Algorithm Based on Clustering Coefficient

A community in a large, real-life network, such as the World Wide Web (Web), has been broadly defined as a group of nodes that are densely linked with each other, while being sparsely linked with the rest of the nodes. In the last 2-3 years, community-mining in such networks has emerged as a problem of great practical significance. This problem has been framed in at least two different versions...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010